| 1. | Classified study on inconsistency of segment for chinese corpus 中文语料库分词不一致的分类处理研究 |
| 2. | On construction of a chinese corpus bused on semantic dependency relations 基于语义依存关系的汉语语料库的构建 |
| 3. | It implies that the method of svm + tbl to the corpus of conll2000 and chinese corpus of our definition 最后本文把svm + tbl的方法应用在conll2000英文语料和我们定义的中文语料上。 |
| 4. | Two kinds of knowledge sources are explored : one is expert knowledge and the other is a small dialectal chinese corpus 方言相关的知识源有两种,一是专家,一是小规模的方言背景普通话数据库。 |
| 5. | Juang der - ming , hsieh ching - chun , and lin hsih have pointed out the extent of difficulties and how they needed to be addressed , particularly in ancient chinese corpora 庄德明、谢清俊和林晰指出了该问题的复杂性和困难所在? ?尤其是对于汉语古籍语料。 |
| 6. | The institute of computational linguistics , peking university has completed the basic processing of a contemporary chinese corpus that has 27 million chinese characters 摘要北京大学计算语言学研究所已经完成了一个有2700万汉字的现代汉语语料库的基本加工。 |
| 7. | Characters missing as a result of code change , particularly in processing classical chinese corpora , is the major hurdle in any large - scale chinese language processing 用计算机处理汉字资料时,常有些字的字形是交换码中没有的,这情形在古籍中特别严重。 |
| 8. | Sun maosong and zuo zhengping have presented a word segmentation algorithm based on a large chinese corpus . the approach may be beneficial to understanding unrestricted chinese texts 他们给出了一个基于大规模语料的歧义切分算法,该方法有助于理解非受限中文文本。 |
| 9. | We present an extensive experimental evaluation of refined concept index on two english collections and one chinese corpus using state - of - the - art support vector machine classifier 因此,在大规模文本分类应用中,特征选择算法往往更受欢迎。不过,概念索引却是一个例外。 |
| 10. | Now , we have written a lot of software for chinese corpus processing , and have gained great achievements . but the outcome of them cannot answer our needs very well , and needs further improvements 当前对汉语语料的加工结果,虽已取得了一定的成绩,但国家的评测结果表明,其离实际需要的差距还是很大的,还有待于进一步的提高。 |